Spectrum Estimation from a Few Entries

نویسندگان

  • Ashish Khetan
  • Sewoong Oh
چکیده

Singular values of a data in a matrix form provide insights on the structure of the data, the effective dimensionality, and the choice of hyper-parameters on higher-level data analysis tools. However, in many practical applications such as collaborative filtering and network analysis, we only get a partial observation. Under such scenarios, we consider the fundamental problem of recovering spectral properties of the underlying matrix from a sampling of its entries. We are particularly interested in directly recovering the spectrum, which is the set of singular values, and also in sample-efficient approaches for recovering a spectral sum function, which is an aggregate sum of the same function applied to each of the singular values. We propose first estimating the Schatten k-norms of a matrix, and then applying Chebyshev approximation to the spectral sum function or applying moment matching in Wasserstein distance to recover the singular values. The main technical challenge is in accurately estimating the Schatten norms from a sampling of a matrix. We introduce a novel unbiased estimator based on counting small structures in a graph and provide guarantees that match its empirical performance. Our theoretical analysis shows that Schatten norms can be recovered accurately from strictly smaller number of samples compared to what is needed to recover the underlying low-rank matrix. Numerical experiments suggest that we significantly improve upon a competing approach of using matrix completion methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bounds for the Entries of Matrix Functions with Applications to Preconditioning

Let A be a symmetric matrix and let f be a smooth function defined on an interval containing the spectrum of A. Generalizing a well-known result of Demko, Moss and Smith on the decay of the inverse we show that when A is banded, the entries of f(A) are bounded in an exponentially decaying manner away from the main diagonal. Bounds obtained by representing the entries of f(A) in terms of Riemann...

متن کامل

Bounds for the Entries of Matrixfunctions with Applications

Let A be a symmetric matrix and let f be a smooth function deened on an interval containing the spectrum of A: Generalizing a well-known result of Demko, Moss and Smith on the decay of the inverse we show that when A is banded, the entries of f(A) are bounded in an exponentially decaying manner away from the main diagonal. Bounds obtained by representing the entries of f(A) in terms of Riemann{...

متن کامل

Thy Friend is My Friend: Iterative Collaborative Filtering for Sparse Matrix Estimation

The sparse matrix estimation problem consists of estimating the distribution of an n× n matrix Y , from a sparsely observed single instance of this matrix where the entries of Y are independent random variables. This captures a wide array of problems; special instances include matrix completion in the context of recommendation systems, graphon estimation, and community detection in (mixed membe...

متن کامل

Wheat Leaf Rust Disease Severity Estimation Using Reflectance Spectrum Coding Methods

Using spectroradiometry and remote sensing techniques is an effective and rapid method in diagnosing vegetation diseases which enforced mostly by using spectral vegetation indices and statistical methods.  The present study aimed to deploy encoding technique for the reflectance spectrum of the wheat leaves to assess the severity of the Rust disease. This is unlike to the spectral vegetation ind...

متن کامل

Determination of height of urban buildings based on non-parametric estimation of signal spectrum in SAR data tomography

Nowadays, the TomoSAR technique has been able to overcome the limitations of radar interferometry techniques in separating multiple scatterers of pixels. By extending the principles of virtual aperture in the elevation direction, these techniques pay much attention in the analysis of urban challenging areas. Despite the expectation of interference of the distribution of buildings with different...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1703.06327  شماره 

صفحات  -

تاریخ انتشار 2017